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The early trnnscription region 3 (l£3) of group B adenovirus type 35 (Ad35), a serotype isolated primarily 
from pnticiits with acquired immunodeficiency syndrome and other immunodeficiency disorders, has been 
partially sequenced. We hod previously identified an Ad35 29-kilodalton (kDa) early glycoprotein which, 
unnlimous to group C Ad2 K.V19K, associated with major histocompatibility complex class I antigens in the 
endoplnsinic reticulum ol' inreeled cells. The open reading frame (ORF) of the Ad3S 29-kDa protein has now 
been identified within n 2-kilobiisc-pair cloned Ad35 E3 fragment. The predicted amino acid sequence was very 
similar to that »r group H Ad3 K3-19K. In contrast, homology between the Ad35 and Ad2 glycoproteins Mas 
limited to live cysteines in Identical positions and a 20-amino-acid region proximal to the transmembrane 
domain In addition. 20..V and 20.6-kDa ORFs have been identified downstream from the ORF for the Ad35 
clvconrotein Analogous 20-kl)a ORFs are present in the Ad3 E3 region but are not present in Ad2 and Ads. 
In* con trust the region analogo\is to an A(I2 1 1.6-kOn ORF, which is 9 kDa in size in Ad3, was absent from the 
exaeeted "position within the AdJ5 K3 region. Because the E3 region is likely to play an important role in the 
interaction "between virus ami host, analysis of the function of the Ad35 E3 proteins should further our 
understanding of adenovirus pathogenesis. 



Ailcmniuis type 35 (AdJ.M is a group U adenovirus that 
been isolated primarily I'rom patients with acquired 
.mmiimHlelicicncy syndrome (AIDS) or other .mmunodch- 
uciKicsiK. 2K. Ml). We have shown that over l(V>ol patients 
«nh A ins or AinS-ielalevI complex surveyed at our msli- 
iMlu.n h.ul urine cultures positive lor (19). In contrast, 

sh^ sci^ivpc was isolaied rarely Irani normal individuals 
«i,h lelMilc illnesses, and specific Ad35 antibody was 
novnl onlv ;U si very low titer in pooled gamma globulin 
,3-1 Wc have therefore proposed thai Acl35 may he unique 
ium ntlicr adenoviruses in its interaction with the immune 
,*.icm . CuiTcnlly. there is no well-esiahlished animal model 
m wIhJi to test this hypothesis. a .i«,...pIw 
Wc have focused on the characterization ol the Ad35 tai y 
i,-.*inpUon region 3 UUi became this region .s kcly to 
' . "in important role in the interaction between the virus 
I S3, ,c host . II has been demonstrated ihut the group C Ad 
.ltd Ad* 1-3 regions code lor a glycoprolem. fc.V winch 
^wutcs with major histocompa.ibiUly complex (MHC 
tl£ I antigens (221. In addition, the L3 region is not 
ctscn, .1 for viral replication in vitro , 20) yet appears to have 
K-cr," served among all adenovirus serotypes s lud.cd o 
^, 5, ift. 17. 2(«. 2W). The Ad2 glycoprotein has been 
W.Lcl io the endoplasmic reticulum CliR). whcit 'MiJcp 
cu-suilh the transport of class I antigens to the cell su. ace 
VNVln hi Wiion of the expression ol class I antigens wh, ch 
, \c needed tor cytotoxic Mympbocy.e recognition ol v u - 
fc"«cd cclN 191. may enable adcnovirus-inlccieu cells to 
S i.nnu.nc recognition and facilitate latency Besides 
11 WK The AU2 K.Uegion also contains eight other open 
Si. frames (ORB. which could code lor proteins impoi- 
2?% ■ infection in vivo. Unre.uly. ""'V^.^'JS 
~ J a \ coded In the 1 Ui-kiUulalton (kDa) and 14.7-kDa 
vl h -ivo Sen si »wn »o be synthesized in adenovirus- 
!f «t J & 131 3 l\. .Another ll>kDu ORF is present ,n 

" o«itc«. no ruling iiiiilior. 



the Ad2 E3 region but is predicted to be noncocling.) 
Recently, the product of the 14.7-kDa ORF has been shown 
to protect infected cells against lysis by tumor necrosis 
factor (14). 

We have previously identified and characterized a group H 
Ad35 29-kDa early glycoprotein which associated with MHC 
class I antigens in the ER, analogous to the Ad2 E3-19K. (11). 
In contrast to the Acl2 glycoprotein, the Ad?!) glycoprotein. 
E29, was synthesized in much smaller amounts, was more 
extensively glycosylated, and did not cross-react with poly- 
clonal antibody against the Ad2 glycoprotein. _ 

We now report the identification of the Ad3:> E29 OKI 
within the Ad35 E3 region. A 2-kilobase (kb) Ad3j> E3 
fragment from map coordinates 79.5 to 83 was cloned and 
sequenced. Ad35 genomic DN A was t*^. 1 ™?"™ 
virions grown in HeLa cells infected w.th Ad35 Holden (30 , 
as previously described (18). The Ad35 flam HI A fragment 
located between 60.5 to 85 map units (m.u.) (32) was cloned 
into PUC12 by standard recombinant DNA techniques. The 
Sht h ind end of the fragment, containing the E3 region, 
wa s cLed by makinguse of one EcoRI restriction sue at 
79.5 m.u. and another within the mult 'cloning sue^ oMhe 
plasmid. adjacent to the BamHl site at 60.5 m.u^ The 
construct wis cut with ficoRI and rehgated and the MJ-ttj 
79 5-m.u. £c«RI fragment was removed. A P UC12 plasma 
contain ng the remaining Ad35 fragment between the 79.5- 
m ?, fZV] site and the 85-m.u. BamHl site was obtained 
Th e 'A^35 E3 fragment was cloned into both M13mpl8 and 

were subcloned into bo h M 13 ve c or s, w ^ B 
ihe IBI Cyclone protocol, based^ the method o u a 
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DNA sequence in all three reading frames identified four 
ORFs which have a capacity to code for proteins larger than 
6 kDa, The predicted proteins had molecular masses of 18.5, 
6.4, 20.3, and 20.6 kDa. The 2-kb Ad35 E3 nucleotide 
sequence exhibited a high degree of homology to a region 
within the group B Ad3 E3 sequence published previously 
(29). The three largest Ad35 ORFs, 18.5, 20.3, and 20.6 kDa, 
are analogous to Ad3 ORFs previously identified. The nu- 
cleotide sequence of the Ad35 E3 fragment and the predicted 
amino acid sequences of the four ORFs, together with the 
alignment of the Ad3 sequences, are shown in Fig. 1. Figure 
2 shows a schematic representation and alignment of the 
Ad35 and Ad3 DNA sequences and ORFs. These data 
represent the first nucleotide sequence established for the 
unique group B Ad35. 

Comparison of the Ad35 18.5-kDa ORF with the Ad3 
E3-19K ORF revealed approximately 80% identity at both 
the DNA and protein levels. Analysis of the predicted amino 
acid sequence of the Ad35 ORF revealed that there are four 
canonical glycosylation sites (Asn-X-Thr/Ser), an observa- 
tion which is consistent with our studies of the carbohydrate 
residues on the Ad35 E29 glycoprotein. The Ad35 glycopro- 
tein has been shown by sodium dodecyl sulfate-polyacryl- 
amide gel electrophoresis to migrate as a 29-kDa band 
because of the presence of four N-linked high-mannose 
oligosaccharides per molecule. The Ad3 glycoprotein has 
not been characterized, but its ORF also contains four 
putative glycosylation sites. In contrast, Ad2 E3-L9K con- 
tains two high-mannose oligosaccharides per molecule and 
migrates as a 25-kDa protein (21). Hydrophobic^ plots 
generated for both Ad35 and Ad3 glycoproteins reveal that 
both proteins have a typical NH r terminal signal sequence, a 
large hydrophobic transmembrane domain, and a short hy- 
drophilic cytoplasmic tail, similar to Acl2 E3-19K (Fig. 3). 

Both the Ad35 18.5-kDa and the Ad3 19.2-kDa ORFs 
exhibited much more limited homology to the Ad2 E3-19K 
ORF at the DNA (50%) and protein (30%) levels. This 
observation was consistent with our inability to demonstrate 
immunoreactivity between the Ad35 E29 glycoprotein and a 
polyclonal antibody to Ad2 E3-19K. However, comparison 
of the Ad35, Ad3, Ad2, and Ad5 MHC class I antigen- 
binding proteins has identified one prominent region of 
homology among all four proteins (Fig, 3 and 4). A con- 
served stretch of 20 amino acids is located adjacent to the 
transmembrane domain, within the lumenal globular domain 
of the protein. In this region, 15 amino acids are identical and 
3 amino acids are related (L-I, F-Y, A-T), whereas only 2 
amino acids are not conserved among all four proteins, Since 
group B and C glycoproteins all share the ability to bind to 
MHC class I antigens, one hypothesis is that the conserved 
domain represents a common binding site. This possibility is 
currently being addressed by site-directed mutagenesis ex- 
periments. In addition, there are five Cys residues which Eire 
absolutely conserved, located at identical positions within 
each protein. It is likely that these Cys residues arc involved 
in maintaining a structure that is conserved by critical 
disulfide bonds. 

Further comparison of the four class I antigen-binding 
proteins has revealed that the last three COOH-terminal 
amino acids, Lys-Met-Pro, are absolutely conserved. The 
cyloplasmic-tail amino acids have been shown to be impor- 
tant for localization of the Ad2 glycoprotein to the ER 
Deletion of the last eight amino acids is sufficient to allow 
Ad2 E3-19K to move out of the ER (25). Our current data 
suggest that three residues may be sufficient for retention of 
the glycoproteins in the ER or that the three amino acids are 



an important conserved domain of the octapeptide ER 
retention signal. 

In addition, we have identified Ad35 E3 ORFs of 20.3 and 
20.6 kDa downstream from the 18.5-kDa ORF. Both of these 
ORFs are strongly predicted by TESTCODE analysis to be 
coding (10). Hydrophobicity plots of the predicted amino 
acid sequences indicate that both ORFs have features typical 
of transmembrane proteins (data not shown). Both proteins 
coded by the 20.3- and 20.6-kDa ORFs were predicted to be 
heavily glycosylated because the sequences contain four and 
five putative glycosylation sites, respectively. Despite these 
similarities to the E29 protein, neither ORF exhibited any 
homology to the 18.5-kDa ORF at either the DNA or protein 
level. The 20.3- and 20.6-kDa ORFs may be distantly related 
to each other, but the alignment at either the protein (27%) or 
DNA (41%) level was not statistically significant 

Both the Ad35 E3 20.3- and 20.6-kDa ORFs are analogous 
to two ORFs in the group B Ad3 E3 unit. Specifically the 
Ad35 20.3-kDa ORF showed 79% identify with the Ad3 
20.1-kDa ORF at the DNA level and a 76% alignment at the 
protein level. The Ad35 20.6-kDa ORF was 70% homoloqous 
to the Ad3 20.5-kDa ORF at the DNA level, aiuf the 
predicted proteins were 60% identical. Neither group C Ad2 
nor Ad5 had an analogous 20-kDa ORF within its E3 region 
which was consequently 1 kb smaller. The function of the 
unique group B E3 proteins remains unclear. In addition, a 
6.4-kDa Ad35 ORF was present in another reading frame 
within the same DNA sequence as the 18.5-kDa ORF. It had 
no analog within the E3 units of the other serotypes se- 
quenced, and TESTCODE analysis yielded no prediction in 
terms of probability of coding. 

Unexpectedly, the analog of the Ad2 ll.n-kDa and the 
Ad3 9-kDa ORFs was absent from its anticipated location in 
the Ad35 E3 region. Instead, directly downstream from the 
20.6-kDa ORF, a DNA sequence closely related to the Acl3 
10.2-kDa ORF was present (Fig. J and 2). The Ad35 F3 
sequence data extend approximately two-thirds mto this 
ORF. Since both complementary Ad35 DNA strands were 
sequenced at least once and the deletion observed corre- 
sponds specifically to a single complete ORF, the absence of 
the E3 ORF is unlikely to represent a sequencing or ciouino 
artifact. 

It is intriguing that the equivalent of the Ad2 11.6-kDa 
ORF was absent from the Ad35 E3 region. The Ad2 protein 
has previously been shown to be expressed in infected culls 
by analysis with antibody to synthetic peptides fn». An 
analogous 9-kDa ORF is present in Ad3, and a related 
7.7-kDa ORF has been identified in group B Ad7 {J. Engler 
personal communication). Perhaps the absence of an analo- 
gous ORF in Ad35 is related to the unique epidemiology of 
Ad35 infections. Ad35 has been isolated primarily from 
immunosuppressed hosts such as patients with AIDS, hone 
marrow transplant recipients, and children with severe com- 
bined immunodeficiency disease. Ad35 infection usually 
causes asymptomatic viruria but occasionally has resulted in 
fatal pneumonias. Perhaps the Ad2 11.6-kDa ORF codes for 
a virulence factor which is absent from Acl35. On the other 
hand, it may not be an essential protein in the adenovirus life 
cycle in vivo. Less likely, the ORF may be present in an 
aberrant location within the Acl35 E3 region and therefore 
was not included in the sequence data presented here. 

In addition, the Ad35 E3 sequence does not contain the 
polyadenylation fpoly(A)] consensus signal that would be 
predicted by comparison with the Ad 2 E3 region. The Ad2 
and Ad5 E3 units have been shown to encode multiple 
overlapping mRNAs; the mRNAs are separated into two 



o 

.4 VO 







Q (J 




«?■ 


(J 






(J 




u> 


U H 










cn 








a: < 


s: 


10 




O 




3 




.8 8 

. 8 o g S 
-o j= j= 

u 1/1 S -n 
3 O OQ V> 



4433 






AG' 








o 


(J 


o 








tn < 


• < 


m 


u 


• U 




u 






t/i < 


■ < 




H 






U 


• CJ 






• e- 




U 


* u 




■ 


■ f~> 


o 




■ H 


o 


f- • 


• H 


in 


o • 


• U 




OH H > 






m u 


•s 




m (j . 


o 




rsj CO f- 






S ' 


cj 




K < ■ 


< e-. 


o 


U • 


u 




O •■ 


CJ 


V 


n: u ■■ 


r ) 




( ) 


tj 







., E .: 




CJ 


" 4 




'j; 




"•r 


^ *-t 


O Q —i 


CJ 


■ O 




L> 


■ U 




I- < 


■ 






• a 






■ O 






■ ri 




E-- 


• (■- 


O 


•c • 


• < 


*N 


>. {., . 




*r 


h : ■ 


• [- 
■ < 




h ! 


• i- 




v o • 


* a 






f... 




t.. E- • 






fj • 


(J 




O • 


(.1 


o 


C5 (.!) ■ 


<:> 




( > 


3. 





4434 



e- 
- *5 



■ 3" 



~2 

O (J 
cn (J U 



3:: 5 



-* o (J , 
(N r- a, u 
• o CJ 
O (_ , 

- w < 
U ■ 



:S 

■ U 

u 

■ < 
u 

5 

■ o 

• H 

e> 



u 
f- ■ 
u 
Eh ■ 
h ■ 



< 
o 

* 3 

? ii 
" "3 

< 

fH 

-s 

O f- 

H ju 

O H 

*8 

Eh 

a: < 



a. u 

H 
13 

o\ {-< 
m m <; 

fN U 
E-< 

a: < 
u 



Eh 
< 

►h «C 

u 
o 

O ft, o 
en (j 

<N KU 
*< 

tj 

"3 



:5 

O 



r> L3 

«3 

u 

U 

a u 

3 

o u 

H 



o - u 

*"5 



«3 



, •* < 

H *• H 

H ■■ H 

> u (J 



-1 (J 
CJ 
U 

U E~ 
O 
E- 

> O 
< 



o 
(/) < 

H 



■ H 



< 

O 
U 

o •• u 
•• u 



O ft 

u 
fH < 

£-* 
E-* 
> O 
CJ 

O ir> 

<T\ lu t- 



1 O •• U 

«; O ■• O 

t- •■ fH 

tn £-• " H 

t-J Cj •■ U 

< < 
CJ <J 

< O " a 

E- H 

< •• < 

o 
cj 

l/> Eh 



• U 
■ E^ 

• E- 
O •- U 



cj 


" U 




a, u 






<c 






o 


it 




O (J 


.. (J 




< 


•• < 










M < 


• < 


O 


E- 


• H 


r- 


U 


(J 


cc 


U E- 


* H 




U 


• CJ 




< 


• < 




O Q O 






»n < 


' < 




1 ' u 


• u 






• u 






• u 


o 




u 


VO 


u 


:S 

• u 


CD 




• w 






< 




U • 


• u 




EH < • 


• < 




O • 


■ u 




<y : 


GC 


O 

m 


< • 


* < 


00 


E** • 


• H 




> u ■ 


■ o 






• < 






• u 




o < o • 






rH < . 


< 




rn o ■ 


u 




H < • 
U • 


CA 


o 


E-* ■ 




CO 


t*, E- • 


H 




t- . 
U • 


CT 




H < • 


< 




fH • 







u 
< o • 
t- 

o .. 

U H 

O 

E- 
> O 
O E- 

- < 

- Z < 



I- 



(J 

U fH 

(J 



<n h < 

U 

U. H 
U 



■ < 

■ U 

• Eh 

■ (J 

■ < 

' (H 



^ 3 

H 

3 w < 

B B 

O i-J U 



n « < 

H 

H 

u 



> o 

2 z 3 

O U 

2 3 

« U 
U 
< 

>* fH 

3 

W U 

CJ 

u 

CL U 

O H 

cn < 

fN a: cj 



a: u 



fH 

> o 

Eh 



o e- 

lA > O 

r-i f-. 

-J a 



o 



u 

• u 

■ H 

■ < 

■ o 

. 

• CJ 

• < 

■ o 

• < 

■ Eh 

U 
i- 

• H 

• U 

:i 

■ u 

• U 

■ (-* 

■ e» 



4*135 



443fi NOTES 



20. (K 2Q5K 



I2.IK J9.2K 

B3I j 

PVIM 16.0K 



I5.2K 



15. 3K 



9.0K I0.2K 



500 1000 1500 



i T 
2000 2500 3000 3500 4000 




fi 

F2 
F3 
Ad 3 

Ad35 

FI 
F2 
F3 



DNA: 0 □ 

Homologous Absent 

Protein: H B | 

Proposed Proven ORF 

nJn| G of 2 ih?T^ riS ° n ° flhe E3 reeions of Ad3 »«• Ali35 ' Align- 
w" , Ad , 35 sec l l,cnc e wuhin ihc 4-kb Ad3 E3 A7,„I.0,,/1 

?I y „ , , » ' Cle0lKleS Whic)l codc l?or » 9 - kD » " (he Ad3 
sequence. Boxes represent ORFs larger than 6 kDn. PI. F2. and F3 
we the three rending frames; K, kilodultons. 

Slffi i E ^ A a ^ C ' P' 0n lhe b ' lsis of the l,se of two Afferent 
poly A) Sltes 4. 5) The major Ad2 E3 niRNA transcribed, 
which codes for E3-19K. uses the first poly(A) site down- 
sire™ from the 11.6-kDa ORF; the poly(A) site is preceded 
by the hexanucleotide signal sequence ATTAAA. The con- 
sensus nucleotide sequence AATAAA is present at the 
second po y(A) site, downstream from the Ad2 S'-terminnl 
ORF. In the Ad3 E3 region, the sequences AATAAA and 

Signal 
Sequence 
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FIG. 4. Alionniem between the Ad35. Ad3. Ad2 and AU-i Mur 
class I nnUgen-bindlng proteins. Identical residues ,rt IdiJ ,«l bv 
« colon. Symbols: □ , conserved domains; .. Cys res to w iS 
me absolutely conserved among the lour proteins, 

ATTAAA are present close together within the 20 1-kDa 
ORF (at nucleotides 1996 and 2008), and an atvn ell so 
quencc A TG A A A, is present at the end of tie 5 kD ORF 
nuc eot.de 3016); one of these sequences tm y ft l1c £ n • s a 
pq y(A) s.te lor the Ad3 19.2-kDa ORF. In contrast o nc 

the Ad35 E3 sequence. It is possible that the absence of a 
poly(A) site adjacent to the Ad35 18.5-kDa ORF couW res uu 
tn less efficient transcription. This hypothesis is „, s S 
with our data that show that the Ad35 E29 glycoZ e » is 
produced ,n significantly smaller amount, coSpTd S,S 
Ad2 b3-i9 K . For instance, studies of Ad2 E3 deletion 
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uitants have demonstrated that the elimination of the first 
oly(A) site or other defined .sites downstream from the 
9-kDa ORF can decrease levels of Ad2 E3-19K mRNA (2). 
dternntively, a novel sequence which can function as a 
oly(A) site for the 18.5-kDa mRNA may be present in the 
vd35 E3 region. 

The functions of the E3 proteins in adenovirus pathogen- 
sis remain unclear. The MHC class I antigen-hinding glyco- 
protein mriy effect virulence or facilitate latency, but direct 
vidence is lacking. Although recent studies of infection of 
he cotton rat with Ad5 mutants have indicated that deletion 
if E3 enhances virulence, similar experiments with hamsters 
tave yielded contradictory results (13, 24). We ale currently 
nvestigatingan alternative model, using a mouse adenovirus 
n its natural host (15). Further studies will also provide 
mportant information in regard to the safety of using an 
idenovirus E3 deletion mutant as a live-virus vaccine vector 
n humans, as has been proposed elsewhere (24). 
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